Speech and speech recognition during dictation corrections

نویسنده

  • Keith Vertanen
چکیده

A natural way to correct errors made while dictating to a computer is to respeak portions of the original sentence. But often spoken corrections are themselves misrecognized, costing the user time and testing their patience. To better understand how users behave while correcting, I created a simulated dictation interface and fooled users into believing they were correcting errors by respeaking. I found that users not only hyperarticulate during corrections, but they do so preemptively before any misrecognition. Depending on the recognizer, hyperarticulation was found to cause relatively minor changes in error rate. The correction of isolated words or phrases was more troublesome, causing substantial recognition problems for an HTK recognizer. Dragon Naturally Speaking, on the other hand, performed slightly better on hyperarticulated speech and only degraded slightly on isolated corrections.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

How productivity improves in hands-free continuous dictation tasks: lessons learned from a longitudinal study

Speech recognition technology continues to improve, but users still experience significant difficulty using the software to create and edit documents. The reported composition speed using speech software is only between 8 and 15 words per minute [Proc CHI 99 (1999) 568; Universal Access Inform Soc 1 (2001) 4], much lower than people’s normal speaking speed of 125–150 words per minute. What caus...

متن کامل

Contribution of a speech recognition system to a computerized pneumonia guideline in the emergency department

OBJECTIVE Evaluate the effect of a radiology speech recognition system on a real-time computerized guideline in the emergency department. METHODS We collected all chest x-ray reports (n = 727) generated for patients in the emergency department during a six-week period. We divided the concurrently generated reports into those generated with speech recognition and those generated by traditional...

متن کامل

A Multimodal Approach to Dictation of Handwritten Historical Documents

Handwritten Text Recognition is a problem that has gained attention in the last years due to the interest in the transcription of historical documents. Handwritten Text Recognition employs models that are similar to those employed in Automatic Speech Recognition (Hidden Markov Models and n-grams). Dictation of the contents of the document is an alternative to text recognition. In this work, we ...

متن کامل

Syllable Analysis to Build a Dictation System in Telugu language

In recent decades, Speech interactive systems gained increasing importance. To develop Dictation System like Dragon for Indian languages it is most important to adapt the system to a speaker with minimum training. In this paper we focus on the importance of creating speech database at syllable units and identifying minimum text to be considered while training any speech recognition system. Ther...

متن کامل

Question Answering in an Oral Dialogue System

In this paper we describe how a computer dictation system can answer certain questions from a physician. Using speech recognition, dictation systems can automatically generate medical reports. Our dictation system, DictaMed, models the medical entities using an object oriented representation. These entities are created along with the dictation when they are mentioned by the physician. The dicta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006